# Off-Policy Learning
Tqc PandaPickAndPlace V1
This is a deep reinforcement learning model based on the TQC algorithm, specifically designed for the PandaPickAndPlace-v1 environment, used for robotic arm grasping and placing tasks.
Molecular Model
T
sb3
14
7
Sac Walker2d V3
This is a reinforcement learning model based on the SAC algorithm, specifically designed for the Walker2d-v3 environment to control bipedal robot walking.
Physics Model
S
sb3
43
0
Dqn MountainCar V0
This is a DQN agent model trained using stable-baselines3, specifically designed to solve reinforcement learning tasks in the MountainCar-v0 environment.
Molecular Model
D
sb3
578
1
Decision Transformer Gym Hopper Medium
This is a decision transformer model trained on medium-performance trajectories in the Gym Hopper environment, suitable for continuous control tasks.
Physics Model
Transformers

D
edbeeching
6,518
6
Featured Recommended AI Models